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5 PEPATmS B vmus MUTATIONS 

The present invention relates to hepatitis B virus (HBV) disease, and in-particular provides 
pqlynucleotide sequences which are characteristic of certain disease states. The invention also 
proNddes tests for detecting the polynucleotides. A test involving protein binding interactions with 
the polynucleotides is also provided. 
1 0 Fulminant hepatitis B [FHBV] is a rare but serious conoplication of acute infection, associated 

with massive hepatocellular necrosis and mortality rates exceeding 50%, depending on coma grade 
FHBV pathogenesis has been associated with elevated rates of viral replication and/or 
abnormally rapid clearance. 

There has been considerable debate as to the relative role of host versus viral factors in the 
1 5 aetiology of FHBV. Outbreaks have been traced to conunon sources suggesting the involvement 
of specific viral strains. Furthermore, particular ^aral variants have been implicated, such as the 
precore stop variant A^^^ ^'^^'^^ which inhibits production of HBeAg, a protein believed to have 
immunomodulatory effects ^^'^^^ and which may exert a dominant negative effect on HBV replication 
An association has also been found between FHBV and variants in the basal core promoter 
20 (BCP)^^ (Figure 1), which regulates the production of precore and pregenomic RNAs; in particular, 
Ti7fi2 and An^ have been implicated. However, as with these variants have also been found 
in non-FHBV sequences and do not occur in all cases of FHBV Complicating the issue, index 
case and contact pairs have been identified where the complete viral genomes are identical, or nearly 
so "^^.One study showed division of patients with FHB into separate classes - rapid (<l-2 weeks) 
25 and slow (> 3weeks) - according to rates of clearance of HBsAg and HBV DNA and intervals 
between onset of chnical illness and hepatic encephalopathy ^. 

We have tested the hypothesis that there are common patterns of genetic variation or 'motifs' 
among FHBV strains . Using statistical and phylogenetic anatyses of the X and C genes fi-om 
unrelated FHBV cases and contaas, of which 19 are previously unreported sequences, and sequence 
30 information fi'om enhancer I region, we show that there are strains of HBV associated with this 
disease, each with a distinct variant pattern, effect on mortality and speed of progression to FHBV 
symptoms. 

Thus, we have reviewed HBV sequences associated with fulminant hepatitis disease in a 
number of patients. We have been unable to associate any single mutation with the fubninant form 
35 of the disease. According to the present invention, by phylogenetic analysis we have surprisingly 
found that certain combinations of unusual nucleotide substitutions (not only relative to the known 



wo 98/45421 



PCT/EP98/02048 



2 

respective genotype but which are sometimes also unusual in all genotypes) in particular positions 
in the sequence correlate strongly with FHBV disease. The mutations may not in themselves be novel, 
but in combination in the respective genotype background they are important. It also appears that 
a variety of mutation combinations can lead to a common phenotype. The combinations of particular 
nucleotides in certain positions are referred to as "motifs" herein. 

The present invention in one aspect provides hepatitis B virus polynucleotide for use 
in evaluation of a hepatitis B disease state vAdch comprises at least two of the following: 

(i) a mutation in the Enhancer I region; 

(ii) a mutation in the Negative Regulatory Element region; 

(iii) a mutation in the Enhancer Il/Core Upstream Regulatory Sequence/Basal Core Promoter 
region; and 

(iv) a mutation which leads to an X peptide amino add change to provide a cysteine or methionine 
residue; 

the mutation being a variation from the normal nudeotide at that position in a respective one of HBV 
genotypes A to F. There may be one or more mutations in any of (i) to (iv). 

The hepatitis disease state which may be evaluated according to the present invention include 
fiilminant hepatitis, severe chronic hepatitis, and serologically unusual forms of hepatitis (including 
HBsAg negative infection, anti-HBcore negative infection and serologically negative hepatitis - all 
of which have been linked to variation in the X peptide or the other regions involved in transcriptional 
control as set out above). 

Another aspect of the inv^tion provides hepatitis B vurus polynucleotide for use in detection 
of fiihnmant hepatitis B viral infection which comprises at least two of the following: 

(i) a mutation at one or more of the following positions 1050, 1249 and 1250; 

(ii) a mutation at one or more of the following positions 1633 and 1634; 

(iii) a mutation at one or more of the following positions 1653, 1754, 1762, 1764, 1766, 1768, 
1809, 1821, 1826 and 1838/9 insertion; 

(iv) a mutation which leads to an X-peptide change to provide cysteine or methionine at one or 
more of amino acid positions 26, 72, 88, 127 and 130; 

the mutations bdhg variations from the normal nucleotide at that postion in a respertive one of HBV 
genotypes A to F. On transmission to a contact these sequences may resuk in fulminant hepatitis B. 

The polynucleotide sequences may be DNA or RNA sequences. Sequences are given in the 
5' to 3* direction. Sequence numbering is according to the convention established by Galibert, and 
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5 numbers from the unique ECoRl restriction site of HBV. The polynucleotide sequence need not be 
continuous and may be broken up into contiguous or non-contiguous segments. However, the 
segments will, of course, have a common origin. 

The polynucleotides and peptides useful for the present invention are generally in isolated 
fomi, isolated fipm materials fix>m the host patient, and usually isolated from other material of HBV 
10 ori^. 

Generally, the polynucleotides are fragments of the complete genome e.g. 20-1000 
nucleotides in length, particularly 50-500, especially 100-200. The term "fragment' means a 
polynucleotide of length less than the corresponding HBV genotype genome. X peptide fragments 
are usually less than the length of the naturally occurring gene expression product. 
15 Certain polynucleotides and peptides are novel and these form a further aspect of the 

invOTtion. 

The hepatitis disease state may be determined directly by sequencinjg an HBV sample from 
the patimt. More conveniently, the polynudeotide moti& may be detected using hybridisation probes 
of complementary sequence which are capable of binding thereto under appropriate stringency 

2 0 conditions (as may be determined readily by experimentation). Generally each probe will be directed 
to a portion of the sequence contaimng a single mutation. However, probes directed against multiple 
mutations may also be used. The probes may be labelled with conventional labels, such as radiolabels, 
fluorescence labels etc. A particular test format comprises a solid substrate having thereon a series 
of bands of characteristic hybridisation probes. The pattern of binding of an HBV sample from a 

25 patient to the bands enables a determination of the presence or absence of motifs characteristic of a 
particular disease state to be made. 

Another test format for the detection of polynucleotide motifs characteristic of FHBV 
involves conducting polymerase chwn reaction (PGR) using primers having a sequence 
complementary to the FHBV characteristic sequences given above. 

30 We have confirmed that certain nucleotide motifs in the NRE/BCP/Enhancer n region (a 

region having gene expression control properties) lead to enhanced gene expression in an 
expaimental plasmid (containing a ludferase gene under control of said motif-containing HBV 
enhancer region). Enhanced expression and replication of HBV virus in vivo is postulated to lead 
to the acute fulminant HBV disease state. 

35 Without wishing to be bound by theory, it is hypothesised that enhanced gene expression 

resulting from the motif-containmg sequmces is due to a reduction of the ability of the motif- 
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5 containing control sequences to bind inhibitoiy proteins present in host cells (i.e. the patient). Normal 
inhibition of HBV gene expression is thus reduced: Experimental evidence of altered nucleotide-host 
protein binding is given herein. 

Thus, the invention also provides a test method for determining binding interaction between 
host or viral proteins and HBV regulatory polynucleotides. 
10 The invention also provides certain antigenic X-peptides, antibodies (e.g. monoclonal) 

thereto, and immunoassays. 

Embodiments of the invention will now be described by way of example only. Sections I and 
n are devoted to phylogenetic analysis and increased transcription respectively. 

SECTION I 

1 5 Statistical and phylogenetic analysis was carried out as follows, to show that certain mutational motifs 
were related to FHB V. 
METHODS 

Patients and Sequence Data 

Because previous studies have shown that sequences from index and chronic contacts of 
20 FHBV patients are often genetically identical ^^'-^ all chronic [CHBV] and acute [AHBV] contact 
cases were treated as FHBV. For the C gene (nucleotide positions 1814-2458) viral sequences from 
30 patients linked to FHBV, representing 27 unlinked FHBV episodes, were analysed. Of these 30, 
19 (16 FHBV, 2 CHBV, 1 AHBV) are newly reported here and 1 1 (6 FHBV, 5 CHBV) were 
obtained from the GehBank/EMBI/DDBJ sequence databases. For the X gene (nucleotide positions 

2 5 1374-1838) 26 FHBV sequences representing 25 unlinked FHBV episodes were analysed. Of these 

26, 17 (14 FHBV, 2 CHBV, 1 AHBV) are newly reported here and 9 (6 FHBV, 3 CHBV) were 
obtained from the sequence databases. With respect to the non-FHBV sequences, 2 X and C gene 
sequences are newly reported here while 159 complete C gene sequences and 86 complete X gene 
sequences were obtained from the sequence databases or from a recent study (sequence list 

3 0 available from authors). This produced total datasets of 1 91 C gene and 1 14 X gene sequences. All 

unpublished sequences have been submitted to GenBank and have been assigned accession numbers 
xxxx-yyyy. 

Fulminant patient information is detailed in Table 1. Patients were classified as either 'rapid' 
or 'slow' progressors to FHBV symptoms (hepatic encephalopathy). Rapid progressors were those 
35 who progressed from stage 0 to st^e 4 coma within a week. Sequences from contact cases first 
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reported here include two symptomless anti*HBe positive caniers (sequences 10 and 21) who fatally 
infected two se)cual partners each \ and one of a pair of sisters, infected simultaneously, one of whom 
developed mild acute infection (sequence 7) while the other acquired severe FHBV (sequence 6). 2 
patients were smun anti-HDV IgM positive and thus co-infected with hepatitis D virus. None of our 
patients were positive for anti-HCV antibodies. In order to assess the incidence of variation in 
enhancer I a GenBank/ncbi/BLAST search identified all published sequences in the re^on (74 
sequences) (list available firom authors). 
DNA extraction and PCR amplification 

PNA extraction, PQl amplification and sequencing were performed as previously described 
(list of primers avmlable from authors). 
Phylogenetic Analysis 

C and X gene sequenceis were aligned v^th the ClustalW program ^ and phylogenetic trees 
constmcted uang a maximum likelihood method (program &stDNAml ^). The maximum likelihood 
transitionrtransversion ratio (Ts/Tv) and relative rates of evolution for the three codon positions were 
estimated using a likelihood program (SPOT) The parameters so determined are listed ui the figure 
legends for Figures 2 and 3. In all cases unrooted phylogenetic trees were constructed and then 
midpoint rooted for clarity. For technical reasons, there was insuflScient FHBV sequence information 
available for an analysis of FHBV sequence clustering in the enhancer I r^on. 
Relative Rates of Evolution 

In order to determine whether there was any difference in the rate of evolution between 
FHBV and non-FHBV sequences, a relative rate test was undertaken (Figure 4). Distances between 
FHBV sequences (a) and their nearest non-FHBV neighbor (b) were compared to their nearest non- 
FHBV outgroup sequence (c) using the DNADIST (nucleotide) and PROTDIST (amino acid) 
programs within the PHYLIP package 13 and 14 comparisons were made for the X and C genes, 
respectively. Tests were also performed for the antigenic regions in the C gene as defined in Carman 
et al. 1995 and according to the fimctional subdivisions in X identified in Yuh et al. 1992 
Analysis at the protein levd was limited statistically to the complete X and C genes and the combined 
antigenic and combined non-antigenic regions in C. Relative rate tests were also performed on FHBV 
sequCTces with and without the Ajg^g pre-core stop codon variant (9 and 7 comparisons, respectively) 
and on a set of non-FHBV A^^^ cases (9 comparisons). In the tests involving Ajg^^, sequences from 
Gi896(HBeAg producing) CHBV cases acted as sequences 6 and c. In all cases the Wilcoxon Signed 
Rank Test was used to test for the significance of any differences in evolutionary rate. 
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FinaDy, variants were divided into those that altered the encoded amino acid (nonsynonymous 
changes) and those that did not (synonymous changes) and a comparison made between FHBV and 
non-FHBV cases using the program INA^ . The relative rate of synonymous and nonsynonymous 
change is a useful indicator of the strength of natural selection. 
Analysis of Clustering 

A cluster was defined here as an uninterrupted phylogenetic lineage of epidemiologically 
unlinked FHBV sequences. We undertook an analysis to determine vAether FHBV sequences form 
distinct dusters on the tree more than expected by chance. This was done by treating FHBV status 
as an additional character state and calculating the expected number of evolutionary steps given the 
number of FHBV cases under a parsimony based model of evolutionary change ^^ as implemented 
in the program FMAB (Bollyky et al., submitted for publication). 
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5 RESULTS 

75 there genetic similarity between epidemiologically unlinked FHBV cases? While visual 
inspection of the X gene (Figure 2) and C gene (Figure 3) phylogenetic trees suggests that FHBV 
sequences are clustered, it was necessary to prove this statistically. For the X gene, the distribution 
of the 25 independent FHBV sequences on the phylogenetic tree could be accounted for most 

1 0 parsimoniously by 14 unambiguous evohitionaiy steps, fewer than the minimum of 21 expected under 
a random mode! of sequence evolution (P= 0,1 x 10"*). In the C gene, the 17 observed unambiguous 
evolutionary steps for the 27 independent FHBV sequences was again less than the 25 expected if 
these sequences were not clustered (P = 0.4 x 10""^). This clustering reflects a high degree of genetic 
relatedness amongst FHBV sequences. Generally the same clusters were found in both genes, 

15 although sequences 15, 19, 21, 23 occupied different positions on the X and C gene trees. Whether 
these discrepancies reflect recombination events ^^ as was clear in the case of non-clustered FHBV 
sequences 25 and 26, or the effect of rapid evolution associated here with FHBV sequences (see 
below), was unclear. 

Do the clusters segregate according to sex, age, HBeAg status, viral genotype, or clinical 
20 outcome of their hosts? Table 1 details these associations. Clustering was most strongly linked to 
mortality; every cluster of sequences being uniform with respect to outcome (i.e. survival or death) 
with 2 excqptions, one was a liver transplant recipient and thus survived while the other was pregnant 
and died. All FHBV cases in indi>aduals who were pregnant, hepatitis D virus co-infected, or over 
30 years of age were fatal, with the exception of those patients who received liver transplants. 

2 5 Clustering was also linked to the speed of onset of hepatic encq)halopathy and clearance of HBsAg 

and HBV DNA, with every cluster of sequences, with 2 exceptions, being uniform m this regard. The 
two patients who were co-infected with hepatitis D virus were found in the same cluster. 

Are FHBV sequences characterised by different rates of nucleotide evolution and by changes 
in spedfic regions? Relative to their nearest non-FHBV phylogenetic neighbor, the FHBV sequences 

3 0 had a significantly higher rate of nucleotide evolution in both the X and C genes (Tables 2 and 3 

respectively). In the X gene, this increase was also significant at the ammo acid level, and localised 
to the BCP at both the nucleotide and ammo acid levels. FHBV sequences also had a significantly 
higher number of synonymous changes in the X gene, indicating an elevated rate of background 
mutation. This relationship is also reflected in the codon position specific weighting ratios determined 
3 5 for the X gene phylogenetic tree, where first and second codon position changes occur at an elevated 
rate relative to third position changes. In the C gene, the elevated rate of evolution in FHBV 
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5 sequences was significant at the nucleotide level for the whole gene and for non-antigenic regions. 

Are particular variants associated with FHBV sequences and clusters of sequences ? No single 
unique variant linked all individual FHBV sequences or groups of sequences; this has been reported 
previously However, particular variants characterised FHBV sequences and clusters of 

sequences (detailed in Table 4). On the nucleotide level, these included variants in enhancer I 

1 0 (enhancer I is known to stimulate transcriptional function of both the X gene promoter and the BCP 
^\ wiants in the negative regulatory element (NRE) and the BCP region of enhancer n. On the 
amino acid level, these included uncharacteristic patterns of otherwise highly conserved cysteine 
residues ^^ which would be expected to play a central role in determining the tertiary structure of 
protein products of X, and aberrant methionine residues, which might be expected to alter levels of 

15 the three protem products thought to be coded by the X gene ^ and thus have effects on 
transcriptional transactivation of HBV. 

Many of these nucleotide and protein variants are genotype specific, altering nucleotides 
otherwise typical for particular genotypes; these relationships are detailed in Table 5. Among variants 
which have been previously suggested to play a role in FHBV pathogenesis, T^^^ and K^^^-j^ were 

20 found to be characteristic of genotype D; 30 and 21 of the 35 genotype D sequences in our dataset 
possessed these variants, respectively, and so they were not associated with FHBV. 

Were there particular 'motifs\ or combinations of variants, which were unique to FHBT^ The 
FHBV sequences possess particular motifs, or combinations of variants which distinguish them fi*om 
non-FHBV sequences. The combination of either aberrant methionme residues, aberrant cysteine 

25 residues, or one of three identified nucleotide variants in mhancer I with any of a group of notable 
variants in the NRE, the BCP, or the A^^^ pre-core stop variant was nearly exclusive to FHBV 
sequences (Table 6). T^j^ and Ay^^ characterised some clusters of FHBV sequences but were 
not by themselves disproportionately represented in FHBV cases. However, where both these two 
variants are present with eitiier T^^^ or Aiygg^ 3 of 4 examples were FHBV. With a single exception, 

3 0 whenever the Aigj^ variant was found with the aberrant cysteine and methionine residues or enhancer 
I variants described here it was in a sequence associated with FHBV. 

What effect does the Ai^^^ precore stop variant have on the rate and pattern of evolution in 
FHBV sequences? In G,g96 FHBV sequences and in the complete FHBV dataset {A^^^^ plus 0,89^ 
FHBV sequences), changes in the complete C gene and in non-antigenic re^ons accumulated faster 

35 than in non-FHBV sequences. In contrast, Ajgjg FHBV sequences had significantly higher rates of 
nucleotide evolution than in non-FHBV sequences in the complete C gene and in both non- 
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5 antigenic and antigenic regions, particularly within the anti-HBc / e2 and anti-HBc / e3 epitopes, as 
well as significantly higher rates of amino acid evolution in all areas tested. These resuks for A^^^ 
FHBV also differed fi'om the CHBV A^^^ controls, which did not accumulate changes at a 
significantly increased rate in non-antigenic regions and differed in the specific antigenic regions 
affected (Table 3). 

10 TABLE LEGENDS. 

TABLE 1 :Clinical data and clustering of FHBV cases and contacts. 

A cluster was defined as an uninterrupted phylogenetic lineage of epidemiologically unlinked FHBV 
sequences. Occasionally there where minor discrepancies in the clusters defined in the X and C genes 
due to lack of phylogenetic resolution. In these cases, such as the movement of sequence 19 between 

15 dusters 1 and 6, wdght was given to the positioning seen within the X gene, as this is the gene where 
the majority of the variants associated with FHBV are described. Those sequences which do not 
duster in the X gene but do in the C gene, either because those sequences were unavailable for the 
X gene or because of the factors such as recombination or the rapid evolution associated here with 
FHBV, are marked with a \ Sequences 25 and 26 are epidiomologically linked and therefore are not 

20 numbered as a cluster. Individual sequences are those which do not cluster with other fiilminant 
sequences in either gene. 

C,X= Core and X genes. M=Male,F=Female.FHBV= Fulminant hepatitis B virus infection; AHBV= 
Acute hepatitis B virus infection; CHBV = Chronic hepatitis B virus infection.* Reported here. **For 
chronic contacts, the outcome and course listed are those of the fulminant index cases associated with 
25 that contact case.@ Patient was pregnant. 

* Patient recieved liver transplant. P Patient was hepatitis D virus co-infected, § These sequences 
show evidence of recombination, grouping with D genotype sequences in the C gene and with A 
genotype sequences in the X gene. 

TABLE 2: Relative rate tests for the X gene compa ring rates of evolution between FHBV and non^ 
30 FHBV sequences 

All resuks are expressed as P= the probability that FHBV sequences are evolving faster than non- 
FHBV sequences. Significant results are in bold type.* Nucleotides are numbered firom the unique 
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5 EcoRl site * Amino acids are numbered from the start of the X gene. CURS= Core Upstream 
R^ulatory Sequence, BO^ Basal Core Promoter. Relative rate test results for synonymous changes 
were significantly different only for the number of synonymous changes in the X gene between FHB V 
and non-FHBV sequences (P=0.014). 

TAgLE3: 

1 0 Relative rates tests for the C gene comparing rates of evolution in the complete FHB V dataset, A.^^ 
FHBV variants, FHEV variants, and non-FHBV Ajg,^ variants, all against Gigj^ non-FHBV 
sequences. 

All results are expressed as P= the probality that FHBV sequences are evolving faster than non- 
FHBV sequences. Significant results are in bold type. * Nucleotides are numbered from the unique 
15 EcoRl site. Relative rate tests for synonymous and nonsynonymous changes were not significantly 
different between FHBV and non-FHBV sequences in any areas tested. 

TABLE 4: Variants suggested to plav a role in FHBV and their distribution in the dataset. 
BCP= Basal Core Promoter, NRE= Negative Regulatory Element, Pre-C = Precore region; Aig^^^ 
premature stop codon that inhibits HbeAg production. 
20 * There were a number of variants in the BCP/enhancer n complex whose incidence was imique to 
a angle sequence or pair of sequences. These were: Ai779(CHBVl), A,79o(CHBVl), Ajyi^CCHBVl), 
Ti„o(HPBC5HK02, HPBC4HST2), Ti8„(HPBC5HK02,HPBC4HST2), C,„9(FHBV13), 
C„,,(CHBV1), and A,«2,(FHBV1). 

± Variation related to FHBV at these sites involves deviations from otherwise highly conserved 
25 genotjrpe nucleotide identity. More than one nucleotide variant therefore describes the relevant 
variation at this position (Table 5). § Where applicable. That variant seen, -= Other variant seen 
at that position, NA= Not available. **Sequences FHB VI 5 and CHBVl have insertions at position 
1838 which cause a fi^e shift. 

TABLE 5: Combinations of variants in motifs that m av play role in FHBV cases 
3 0 Variants or combinations of variants playing role in FHBV. + = variants observed. - = variants not 
observed, n/a = not available, ?= not known. Underlined variants are unique variants observed in that 
particular case or only few cases. Bold variants occurred outside the genotype context. C^^q is not 
a unique variant but significant in combination with another unusual variant (c,773). *= For case 
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5 FHBV-1 0, no variants were identified and for case FHBV- 1 1 , sequence data was not available in the 
regions considered except pre-core. Enh=enhancer. BCP= basal core promoter. NRE=negative 
regulatory element CURS= core upstream regulatory element. 

TABLE 6: Deviations firom typical genotype associations at a number of par ticular nucleotide and 

aminp ftpid ppsitioni? in the X geng, 

10 * All three occurrences of 0,773 genotype A are in FHBV sequences. Variation in the other 
genotype at this location seems to be less restrictive.'* Sequences either have a cysteine a position 
6 or position 78, depending on genotype. The cysteine at position 78 has been shown to be disulphide 
bonded^°, suggesting this difference is of some importance in determining protein structure. 

TABLE 7: Motifs, or combmations of v ariants, suggested to plav a role in FHBV and their 
15 distribution in the dataset. 

MOTIF 1= BCP nucleotide variant^ aberrant cysteine residue in X protein. 

M0TIF2 = BCP nucleotide variant + aberrant methionine residue in X protein. 

MOTF 3 = NRE nucleotide variant + aberrant cysteine residue in X protein. 

MOTIF 4 = NRE nucleotide variant + aberrant methionine residue in X protein. 
2 0 MOTIF 5 = NRE nucleotide variant + enhancer I nucleotide variant. 

MOTIF 6 = A,j96 precore stop variant + aberrant cysteine residue in X protein, 

MOTIF 7 = A,89g precore stop variant + aberrant methionine residue in X protdn. 

MOTIF 8 = Ai896 precore stop variant + enhancer I nucleotide variant. 

BCP = Basal Core Promoter, NRE= Negative Regulatory Element, Pre-C= Precore region; A,8(^= 
25 premature stop codon that inhibits HbeAg production. Only sequences for which complete 
information was available for the areas in question are listed. The A^g,^ total is out of 43 X gene 
sequences with the A,89^ precore stop variant. 

TABLE 8: FHBV Variant Motifs 

* See section on genotype specific FHBV variants. 

30 TABLE 9: Presence or absence of v ariants identified in table 4 in 7 non-FHBV and IS FHBV 
sequences, all of genotype D. 
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5 TABLE IQ: Clinica) data of FHBV md chropig gpntrgl gasgs. 

M= male, F= female, N/A= not applicable, n/D= not done, +ve= positive, -ve= negative, ++= highly 
positive, rV = intravenous needle 

FHBV= fubninant HBV, CHBV= chronic HBV, dot/blot= DNA was detected by dot-blot 
hybridisation, ?= not known, *=Titre indicates final dilution at which sample remained positive. 

10 TABLE 1 1 : Primers ysgd for PCR 

Table 1 1 shows the sequences of primers in 5' to 3' direction used for PCR. The underlined sequences 
are the restriction sites. Outer= primers used for 1st round PCR, InneT= primers used for nested PGR. 

TABLE 12: Sense strand ohgonucleotides used for gel shift. 

Table 12 shows the oligonucleotide sequences used for gel shift analysis. Only sense strands are 
1 5 shown here. Nucleotide positions of 5' and 3' ends, numbered from the Eco RI site, are given above 
oligonucleotide sequences. 

TABLE 13: Luciferase value correlated with variation in the CURS. BCP and pre core region. 
Variants found in the region between 1549-1974, which contains the CURS and the BCP/Enh-II 
complex, are conadered. The clustering pattern and patie number follows that in the previous study 

20 (BoUyky ei al, 1997, submitted). CURS = core upstream regulatory sequence, BCP= basal core 
promoter. Enh-I Enhancer II. CHBV-1 and -2 are infective contacts of fubninant cases. H= HIGH 
(luciferase value > 7), 1= intermediate (luciferase value 2.0-7.0), N/A= No applicable, N=Nonnal 
(ludferase value < 2), a= deletion;ig3gAig39 and 1838AC1839 denote an insertion of A or AC between the 
noted nucleotide positions. The standard deviation is derived from at least 6 replicates for each 

25 construct. Rapid/slow disease progression was a clinical judgement; allocations to categories were 
made before these studies were undertaken. Underlined variants are not uraque but significant in 
comination with another variant(s). 
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DISCUSSION 

We report the clustering of FHBV sequences from epidemiologically unlinked sequences within 
particular viral lineages. Further, these viral cluster relationships seemed to predict death or survival 
in the absence of additional confounding factors for poor prognosis such as pregnancy, coinfection 
with HDV and older age. This is conclusive evidence of a viral genetic basis to FHBV. 

Further, 21 of the 26 X gene FHBV sequences had a combination of variants in the complex 
which regulates pregenomic and precore production (the NRE, the BCP region of enhancer U and 
the Aj89g precore variant), plus one of 3 alterations which might be expected to affect the production 
or function of X gene protein products (aberrant methionine or cysteine residues or enhancer I 
variants). These combinations, or motifs, were nearly exclusive to FHBV sequences. We propose a 
novel concept in studies of HB V: individually, many of these variants are not unique to FHBV 
sequences but that their pathogenic significance lies in their mteraction. 

We hypothesise that these motifs are associated with common fimctional effects. The significantly 
higher rates of nucleotide evohition in the C and X genes and in the HB V genome as a whole (results 
to be published separately) suggest that an increased rate of evolution is a hallmark of FHBV. The 
elevated rate of synonymous changes in X and of evolution in non-antigenic regions in C are 
consistent with this interpretation. Further, we have shown that the BCP region of A,g9^associated 
FHBV sequences has significantly increased transcription efficiency (Yasmin et al., submitted). This 
supports one report of increased replication and encapsidation efficiency shovm for an individual 
FHBV sequence ^ Another novel, thougji related, concept to arise from this work is that specific 
variants can be linked to a different clinical outcome if they occur outside of their usual genotype 
context. Further, particular genotypes may predispose patients to FHBV: 15 of the 26 X gene FHBV 
sequences were of genotype D (out of 35 sequences available for that genotype). 

These motifs, however, are clearly not entirely equivalent; the particular variants which 
diaracterise individual dusters of FHBV sequences appear to resuk in their distinct clinical features. 
A &tal outcome was assodated vwth every FHBV sequence with either the T^^o or A^y^ variants. In 
two other studies, 3 of 4 "and 4 of 5 " such cases were fatal. In a recent computer modelling study, 
these variants have been suggested to lead to alterations in RNA superstnicture encompassing the 
encapsidation signal loop from which DNA replication is initiated and are part of the TATA 
complex involved in initiation of precore and pregenomic transcripts Along with positions 1762 
and 1764, position 1773 is part of the TATA complex. Every FHBV case associated with aberrant 
Ci773 was similarly fetal. Different classes of FHBV are also associated with distmct selective forces. 
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5 as the A,|,6Ff!BV sequences are shown here to have different patterns of evolution in the C gene 
than FHBV sequences without this variant. 

The Ai896 vanant has been found in both CHBV as well as FHVB cases and it has previously 
suggested that the transmission of the A,g95 variant and HBeAg negative status results in FHBV 
pathogenesis Building on our recent report of the transmission of the variant, and 

1 0 presumably HBeAg negative status, on a significant scale (Boilyky, et al. submitted for publication), 
here we conclusively show that the combination of A,g95 with other FHBV associated variants 
distinguishes Ajgi^ chronic from Ai896 futaunant cases Lending further support to this hypothesis of 
non-equivalent efiFects of Ai896 in FHBV and CHBV cases, we have also shown here that AigjgFHBV 
sequences have different patterns of evolutionary change than A^^^ CHBV sequences. 

15 The phylogenetic clustering of unlinked FHBV cases, when FHBV is itself a viral transmission 

dead end, would suggest that certain viral genetic contexts, such as those defined by strain and 
genotype identity, might either be particularly susceptible to acquiring specific FHBV associated 
variants or ateady possess some elements of the FHBV motif combinations. We propose that the 
symptomless contact who transmits HBV resulting in fulmmant hepatitis has a heterogeneous 

20 population of HBV strains, or quasispecies characteristic of quickly evolving viruses some of 
which possess all the necessary elements required to cause FHBV in de novo infections by triggering 
a massive unmune response in the presence of elevated rates of viral replication. 

It is dear that looking for a single mutational cause for all cases of FHBV is too simplistic. The 
results presented here, based on a large dataset and the application of phylogenetic and statistical 

2 5 methods, conclusively show that there are multiple sequence routes to FHBV pathogenesis which we 
believe share conmion fimctional features. While we do not discount that transmission related effects, 
co-infection with other viruses, and host factors such as HLA, as shown to be related to viral 
clearance may be involved, we believe that viral sequence is a necessary prerequisite and central 
to the causation of fulminant hepatitis B. Greater consideration must certainly be given to the 

30 interactions of multiple viral factors and the context within which specific variants occur. Closer 
examination of the multiple routes leading to FHBV promises to shed light on HBV 
immunomodulation and transcriptional and translational control. 
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5 TOUWESU06 
Figure 1 

Organisation of genes and regulatory regions along the HBV genome, CURS = core upstream 
regulatory region. BCP = basal core promoter. NRE = negative regulatory element. Pre-C = pre-core 
gene. C = core gene. X = X gene. DR = direct repeat (DRl initiates reverse transcription, DR2 
1 0 second strand synthe^). The DNA is separated into two strands for clarity (though both strands are 
the coding strand). The ruler indicates nucleotide positions nimibered from the unique EcoRl site. 

Figure 2 

Clustering of FHBV sequences on the maximum likelihood tree of the X gene. For the 191 C gene 
sequences the Ts/Tv was determined to be 1 .24 with the relative rates of evolution as 0.87:0.62: 1 .5 1 
15 at codon positions 1, 2 and 3, respectively. Lineages which possess FHBV status are indicated. 
Horizontal branch lengths are drawn to scale. Clusters of sequences, as defined as FHBV lineages 
which have no intervening non-FHBV sequences, are numbered in bold. Genotype of each major 
branch is gh^en. Genotype £ is indistinguishable from genotype D withm the X gene, as has been 
noted previously 19. 

20 Figure 3 

Clustering of FHBV sequences on the maximum likelihood tree of the C gene. For the 1 14 X gene 
sequences the Ts/Tv ratio and the codon-specific relative rates of evolution were found to be 1 .28 
and 1.49:1.04:0.50 at codon positions 1, 2 and 3, respectively. Lineages which possess FHBV status 
are indicated. Horizontal branch lengths are drawn to scale. Clusters of sequences, as defined as 
25 FHBV lineages which have no mtervening non-FHBV sequences, are numbered in bold. Genotype 
of each major branch is given. 

Figure 4 

A relative rate test compares rates of evolution in two phylogenetically related viruses, a and b, by 
con^aring their DNA and amino acid sequences to that of a third, reference virus c, which is more 
3 0 distandy related to both (i.e. it is an outgroup). Because a and b share more recent common ancestry 
with respect to sequence c, they should be equally distant to this outgroup sequence if they are 
evolving at the same rate. 
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5 Figures 

The complete DNA sequence data from 1000 to 2500 for seven non-FHBV patients (comparison) 
and fifteen FHBV patients (invention), all genotype D, showing the mutations from the normal at 
various variant positions. 

Rgure6 

1 0 Diagramatic representation of various mutational motifs associated with EEiB V. 

Here we show that the c/^-acting sequences from four of the clusters have increased 
transcriptional activity in vitro and that the variant promoter sequences have lost nuclear factor 
binding activity. 

15 PATIENTS AND METHODS 
Cases and controls 

In the phylogenetic analysis (Section I), 16 FHB cases, 3 FHB contact cases and 2 AHB cases 
were studied. In the fimctional analysis reported here, we chose 1 1 FHB or FHB contact cases from 
that study and 7 additional chronic hepatitis (CHB) controls not associated with FHB following 

2 0 transnussion. Clinical and demographic details are gjven in Table 10. We grouped contacts with FHB 

having shown previously that their HBV sequences cluster phylogenetically. Figure 7 gives the 
sequences of all the subjects from the beginning of the core upstream regulatory sequence (CURS) 
to the end of enhancer n (nt 1643- 1849) indicating the variants we defined previously as being more 
common in FHB than in chronic cases, taking into account the genotype-spedfic variability. We have 
25 included two additional laboratory standard control sequences of subtypes acA4'2 (genotype A) and 
qyw (genotype D). 

DNA extraction, PCR, sequencing and sequence analysis 

In brief, 25 ul serum was digested overnight in Img/ml proteinase K at 37° C. DNA was 
extracted once each with phenol/chloroform (1:1), once with chloroform and precipitated in ethanol. 

3 0 Finally, the DNA was resuspended in 1 5 ul dHjO. For PCR amplification, 5 ul extracted DNA was 

amplified by 25 pmol each of sense and antisense outer primers. 1 ul of first round product was 
nested, or hemi-nested. A list of primers is given in Table 1 1 . DNA was sequenced directly using 
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5 Sequenase version 2.0 (USB, Amersham, UK). 
Lucif erase assay 

Constructs were based on vector pBL (23), which includes the luciferase gene but without its 
own complete promoter. Nucleotides 1549-1975 of HBV (generated by PCR using primers C8m & 
BC3, Table 1 1), which includes the negative regulatory elements (NRE), CURS, BCP, enhancer n 

1 Q (Enhll) and a portion of core gene were cloned into pBL upstream of the luciferase gene. Clones 
were sequenced to verify their identity. Luciferase assays were performed as previously described 
(24). In brief, lOug plasmid DNA was transfected into two 60 mm plates containing 50-70% 
confluent Huh7 hepatocyte cells by standard calcium phosphate transfection. To overcome the effect 
of variable transfection efficiency, each plate was co-transfected with 0.2 ug pS V vector containing 

15 the P-galactosidase gene. 72 hrs after transfection, cells were lysed with 25mM Tris phosphate, 1 0% 
glycerol and 0, 1% triton X-100. 50 ul of cell lysate was mixed with 350 ul of assay buffer (20mM 
tricine, 1.07mMMgCO3-5H20, 2.67mMMgS04, 0.lmMEDTA, 33.3mM DTT, 270mM Co-enzyme 
A, 470mM luciferin, 530mM ATP. pH 7.8), and luciferase production was either measured in an 
automated luminometer, or with the Luciferase assay kit (Promega, Madison, USA) following the 

2 0 manufacturer's instructions. For each construct, the assay was done at least 3 times; for each assay, 
duplicate plates were used; and for each plate, duplicate measurements were taken. The P- 
galactoadase assay vsras performed with the same cell extract. 50 ul of cell extract was incubated with 
a buffer (50mMNa2HPO4, 39mMNaH2P04, ImMHCl, ImM MgS04, Img/ml ONPG) at 3TC untU 
a pale yellow colour developed. The OD at 405 nm was measured. The luciferase value for each 

2 5 experiment was calculated by dividing the mean unadjusted luciferase value by the mean P-gal value 

and the luciferase value for each construct was the mean of all experiments. 

Preparation of nuclear extract and mobility shift analyses 

Nuclear extracts were made fiom HeLa cells and HuH7 cells by a method described by Trautweiri 
et al (25), and stored at -70°until used; for each experiment, a fresh aliquot was used. Complementary 

3 0 oligonucleotide pairs corresponding to the variant HBV sequences in Enh-I and Enh-II/BCP complex 

(Table 3) were made either using an in-house oligo-synthesizer or purchased from Oligold 
(Eurogentec, Seramg, Bel^um). Oligonucleotides were annealed to make double-stranded DNA was 
labelled with yATP. Labelled oligonucleotide was purified from unmcorporated '^P with 
Sephadex G-50 spin cohmm (Pharmada. Uppsala, Sweden). For gel shift experiments. 3-5 x 10^ cpm 
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5 of labelled oligonucleotides were mixed with 4 ul of binding buflfer (final concentration: 25mM 
HEPES pH 7.6, 5mM MgCl2 and 34mM KCl), 1/^g poly dIdC, 2/zg BSA, 2 ul proteinase inhibitor 
cocktail (20inM DTT, 2inM PMSF) in a total of 10 ul volume and kept on ice. The mix was then 
added to equal volume of dialysis buffer (25mM Hepes, ImM EDTA, 40mM KCl, 1 1 .4% glycerine) 
containing 0, 1 and 3 ug of nuclear extract. The mix was left at room temperature for 15-20 mins 
1 0 before analysis on a non-denaturing 6% polyaciylamide gel. 

HBV core protein distribution in vitro 

to express the core gene with its homologous promoter, nts 1 549-2458 were generated by PCR 
using C9y & C2y as outer and C8y & C4y as inner primers (Table 1 1), then cloned into pT7-blue 
(Novagen, AMS Biotechnology, Oxon, UK). The Pstl-XhoJ fragment was recloned into vector 

15 pKLt55 (a gmerous gift from Dr Walter, Genetics Dept, Glasgow University) whose own promoter 
had been removed. To study the effects of core protein sequence variability on intracelhilar 
distribution without any effects from its homologous promoter, a fragment of nt 1818-2458 
containing core gene only, was generated by PCR from the pKLt55 construct using primers C5e and 
C4h and cloned into vector pKK5 (a generous gift from Prof H Will, Hamburg, Germany) which 

20 contams the SV40 origin of replication and the CMV early promoter. Core constructs under the 
control of the homologous promoter were expressed in HepG2 hepatoc3rte cells and those under the 
hetCTologous promoter were expressed in both Cos? and HepG2 cells. Briefly, 5ug plasmid DNA was 
transfected into cells on a 16mm coverslip in 35mm dish by standard calcium phosphate precipitation 
method. Cells were harvested 48-72 hrs after transfection, fixed with paraformaldehyde, labelled with 

25 polyclonal anti-core IgG (Sigma Chemical Company, St Louis, USA) as T antibody and anti-rabbit 
mouse IgG as 2° antibody. Cells were examined using a Nikon Microphot-SA fluorescence 
microscope. 

EESULIS 

Fulminant hepatitis B virus constructs have a higher transcription level 
30 A sequence greater than 400 nt containing the NRE, CURS, BCP, Enh n and the 5' end of the 

core gene from a total of 20 patients was cloned into a luciferase-expressing vector. Eleven of 20 
were FHB; 4 of them were HBeAg negative in the first available samples. Two were laboratory 
standard adw (HBV adw2 in Genbank; genotype A) and qyw (Xxhepav in Genbank; genotype D) 
subtype controls (both HBeAg producing) and the remaining 7 were CHB controls with weU known 
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5 clinical characteristics. Table 13 shows the luciferase level correlated with variation in the cis acting 
and pre-core regions. Seven of 1 1 FHB cases, all with variation in the first two- AT rich regions of 
the BCP (Figure 7) showed substantially higher luciferase activity compared to control adw which 
we considered as background or normal. Six of these 7 cases also had A^g^^ and one with Gi896 
sequence had an A insertion at 1838 and was therefore an HBeAg negative strain. Two of 1 1 cases 

1 0 (FHBV-8 and FHBV-9) had an intermediate level of luciferase level compared to control adw. For 
case FHBV-8, 2 clones were tested, one wth and the other with but both had variant BCP 
regions. Both clones showed the same level of luciferase activity, but this was not consistent (higher 
standard deviation). Case FHBV-9 was Gig^g and invariant in the BCP, but had a single variant in the 
CURS. The last 2 of the 1 1 FHB cases (KEffiV-12 and FHBV-16) had normal luciferase activity, and 

1 5 contained with almost identical variation in the CURS and BCP. Only one of these variants was 
in the third AT rich region and both sequences were invariant in the first two AT rich regions. Six of 
7 control CHB cases showed normal luciferase production comparable to adw control. Three of 7 
CHB controls had HBeAg-producmg sequences (G^^X remainder were Ajg^^. 

Turning to spedfic variants, of those in the BCP, Tp^j (in three) and An64 (in five) were the most 

20 common in CHB (5 of 7 cases). These variants also were common in FHB associated with high 
luciferase expresaon but ahvays were accompanied by T,7^ and/or A,76g. This indicates that Tp^j and 
Ai7g4 alone are not suflSdent to afiect transcription in our system. However, (without A^j^) with 
one other unique variant {A^^ appeared to be crucial in case FHBV-1 . The only CHB control case 
which had intermediate ludferase activity (1-40), was HBeAg positive and contained a deletion in the 

25 BCP fix)m ntl754-1762 (Figure 7). The fiinctional eflfect of combinations of non-unique variants was 
further seen in the linkage of variants at nts 1 727 and 1 740. G^^? and Tj74o together were observed 
4 times, but only in the group of 7 high luciferase-producing FHB sequences. The control ayw 
subtype had higher luciferase activity compared to adw and had considerable differences in the 
BCP/enhancerH complex, particularly at positions 1678, 1727, 1740 and 1773 (Figure 7). Con-qyw 

30 had Aj727 and C,74o; a number of FHB and control CHB cases also contained A1727, but always with 
T1740. This is a remaricable parallel wth our finding of the 4 FHB cases with G1727 and T^^^ , l^y^ was 
found in 8/1 1 of the FHB cases with high or intermediate luciferase activity. Three FHB cases had 
Ti678> 2 of them had high and 1 intmnediate ludferase activity; con-qyw, the control with intermediate 
activity was the only control sequence to have Tj^. In summary, although the BCP is generally 

35 variable, some unique variants, but more importantly, combinations of variants ("motifs"), were 
assodated with high or intermediate transcriptional activity, nearly always in FHB cases. 
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5 Lucif erase activity correlates with disease progression 

There was a cx)rrelation between high luciferase expression, rapidity of disease progression, and 
seroconversion to anti-HBe. All 7 FHB cases with high luciferase expressing sequences had rapid 
disease progression regardless of their clinical outcomes (Tables 1 and 4). In contrast, FHB cases 
with intermediate or normal luciferase expression had slow disease progression. There was a 
1 0 correaltion betweai high luciferase expression and HBeAg status in FHB cases. Six of 7 FHB cases 
with high luciferase expression were HBeAg negative on admission and serbconverted to anti-HBe, 
other 4 FHB cases with intermediate and normal luciferase expression were HBeAg positive on 
admission and 1 of them seroconverted to anti-HBe. In contrast. Four of 7 HBeAg negative CHB 
cases had normal luciferase expression. 

1 5 BCP and enhancer I sequences from FHB have different nuclear factor binding patterns to non- 
FHB controls 

To investigate the eflfect of BOP variation on binding of transcription factors derived from nuclear 
extracts, complementary oligonucleotides spanning nts 1748-1783 from 6 FHB cases with variability 
in the BCP and from adw subtype as a non-FHB control were synthesised. As can be seen from 

2 0 Figure 7, the chosen oligonucleotides included most of the variants previously identified as linked to 

FHB. Figure 8 gives the results of nuclear factor binding and Figure 10 compares the sequence to 
cartoons of the banding patterns. The non-FHB sequence {OL-adw) shows 3 clear complexes 
(Figure 8a), whereas all FHB sequences except one (FHBV-1), which had only one mutation (Jn^-^, 
bound poorly, or not at all, with complexes II and HI, Case FHBV-1 (Figure 8c) which showed a 

25 similar binding pattern compared to non-FHB control (OL-arfw'), had two bands in complex-II. In 
order to identify liver specific complexes, nuclear extracts from HeLa cells were made, and the 
binding assay repeated using oligonucleotides from 3 FHB and 1 control non-FHB cases. From 
Figure 9b, it is clear that complexes n and III are not hepatocyte specific. The BCP variant 
oligonucleotides showed a amilar pattern of binding to both nuclear extracts (Figure 9b, lane 3, 4 and 

30 5). A competitive experiment using unlabelled oligonucleotides as a cold competitor showed the 
specificity of the interactions (Figure 8.d). 

A number of FHB sequences had variants in Enh-I, particularly TjosqC, G1249C and Tj^so- Four 
pairs of oligonucleotides with or wthout Enh-I variants were synthesised (Table 12) and bound to 
nuclear extracts of HuH7 cells. Figure 1 1 shows that variant G,o5o (oligonucleotide em-1) does not 

3 5 have any eflfect on the nuclear factor binding pattern, but variants Ti24s> and Cijso (oligonucleotide em- 
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5 2), in contrast to G1249 and T1250 (oligonucleotide ew-2), do not give rise to complex 3. 

Finally, a nuclear factor binding assay was performed using an oligonucleotide with an insertion 
of A at 1838 (Table 12) as detected in two patients (CHBV-1 and FHBV-15). No difference was 
observ^ed in the binding pattern (data not shown). 



Expression and distribution of core protein was not different in FEBV. 

10 To investigate the pattern of core protdn and its intracellular distribution under the control of the 

homologous promoter, an area spanning nt 1549-2458 from 13 patients (8 FHB or contacts of FHB, 
4 CHB and 1 AHB) was cloned into vector pKLt55. Core proteins were expressed in HepG2 cells 
and labelled with polyclonal anti-core rabbit IgG. The influence of core protein sequence on 
intracellular distribution was addressed using core nt 1818-2458 from 1 1 FHB, 4 CHB and 1 AHB 

1 5 cases in veaor pRK5 under control of the CMV early promoter. The core proteins were expressed 
in both C0S7 and HepG2 cells. HBcAg distribution was observed in both cytoplasm and nucleus and 
there was no obvious difference in the level of HBcAg expressed in FHB and control cases. A similar 
level of expression was observed using either homologous or heterologous promoters (data not 
shown). 

20 DISCUSSION 

A c/5-acting transcriptional regulatory element of approximately 100 bp (nt 1743- 
1 849) has been shown to fimction as the BCP (26). This region also contains the 3 ' end of the X gene 
and the 5' end of the encapsidation signal and is suflBcient for accurate initiation of both pre core and 
pregenomic KNAs. A sequence element upstream of the BCP, the CURS, binds to hepatocyte nuclear 

2 5 factors HNF-3, HNF-4, C/EBP or other transcription factors, and stimulates the activity of the BCP 
(27,28,29). Enh-n ova-laps the BCP and has a stimulatory effect on the BCP, as well as the SP-I and 
SP-n promoters (28,29). Three AT rich regions have been mapped within the BCP region, the first 
two of which serve as the initiation sites for two longer pre-core, and the third one a shorter 
pregenomic, mRNAs (30). The two most common variants in these AT rich regions are Tn62 and 

30 Ai764> observed in association with both chronic HBV carriers and FHB cases. Initially they were 
believed to be correlated with HBeAg negative phenotype, perhaps acting by downregulating precore 
mRNA synthesis (30,31). However, this has not been confirmed by others (32), although there may 
be an assodation wth lower levels of HBeAg (32). 

One hypothesis to explain the massive liver cell injury in FHB is that altered binding of 
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5 transcription factors leads to increased virus replication which, in predisposed individuals, induces 
an exaggerated immune response. We have shown clearly that c/>acting regions from four 
phylogenetically linked clusters of FHB viral sequences which are epidemiologically unrelated have 
enhanced transcriptional activity in vitro. Further, this effect was confined to those FHB-associated 
sequences containing variants in BCP as well as A^^. The only exception to this rule was CHBV-1, 

10 a symptomless male contact implicated in causing FHB in successive wives (4). His sequence also 
showed high ludferase expression despite but this was accompanied by an A insertion at ntl 838 
and Ci862 which lies within the encapsidation signal and might have altered the RNA secondary 
structure, FHB cases with sequences (clusters 3 and 7) had normal or intermediate luciferase 
levds. Two of these cases with normal transcription activity had variants m the BCP, one of which 

1 5 was in the third AT rich re^on (at nt 1794); its eflfect on transcription is thus unknown, but is unlikely 
to be important. It is illuminating that A|g95 sequences from chronic carriers not assodated with FHB 
cases had normal transcriptional activity whilst Ajg^^ containing contacts of FHB cases had levels 
similar to Ajg^g-FHB cases. Aig^^ containing chronic carrier controls also had variants in the BCP, 
commonly 1^^^ and A^^, But in FHB cases these two variants were accompanied by T^^^, Aj^^g or 

20 both. While this manuscript was in preparation, these latter two variants were found based on 
sequence generated from a single patient, to be associated with an elevated rate of replication (33). 
It is clear that the two most common variants in the BCP (Tpgj and A^^) do not contribute to the 
level of transcription unless they occur in combination with other variants. This substantiates the 
conclusions from the phylogenetic analysis that there are multiple sequence motifs in FHB- assodated 

25 cases which have a common functional outcome and that A^^^ itself is not, per se, the major factor 
but is a marker for other variation within the cis acting region of the genomes associated with it. 
Clearly, this helps to explain why there are large numbers of persons infected with A^^^ strains but 
very fisw cases of FHB. As FHB as a clinical entity is rarely, if ever, transmitted, it must also be true 
that this increased transcriptional activity must be suppressed or otherwise offset in chronic carriers 

3 0 who have it, as it was found that core promoter variation in chronic patients has no effect on 
transcription (34). 

The second finding of this work is that, in most cases with increased transcription, binding of one 
or more nuclear factors is impaired. Oligonucleotides were synthesised which encompass the regions 
in BCP which bind several transoiptional regulatory proteins mcluding a liver specific factor C/EBP 
35 (29). A recent study found that T„62 and A^^^ inhibition of binding of nuclear factors (35) is liver 
^ecific, as nuclear extracts from HeLa cells led to fewer bands. In our study, the missing bands were 
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5 not liver specific, as nuclear extract fi-om HeLa cells showed the same pattern of binding. Figure 1 1 , 
v^ch schematically correlates the variation in the BCP with nuclear factor binding patterns, indicates 
that particular nudear protein binding patterns are associated with BCP variants. The oligonucleotide 
containing only one variant (FHBV-1) showed a similar pattern of binding to the non-FHB control, 
indicating that T|7^ alone is unlikely to inhibit binding of nuclear factors. However, the 

10 oligonucleotide derived from case FHBV-1 5, which contained one unique variant (€5752) and one 
variant out of its usual genotype context (A1757) showed binding inhibition. Oligonucleotides from 
other FHBV cases with variants m the BCP showed a similar pattern of inhibition. This indicates that 
a number of combinations can lead to lack of binding to transcription factors to a similar fimctional 
outcome. All of these variant cases are associated with high or intermediate luciferase activity; we 

15 therefore conclude that these two complexes have an inhibitory effect on transcription. 

TWrdly, we could not distinguish any differences m HBcAg distribution in liver-derived cells in 
culture from FHB compared to sequences from chronic carriers. HBcAg was seen in both nucleus 
and cytoplasm. The amount of HBcAg production was qualitatively similar whether homologous or 
heterologous promoter systems were employed. In parallel studies (Doman et al, submitted) in 

2 0 chronic cases, sequential samples showed shifts in distribution of HBcAg from nucleus to cytoplasm. 
Clearly, if this is of relevance to the pathogenesis of chronic hepatitis, a different mechanism is 
operating in fulminant hepatitis. 

Inta^stingly, there was a correlation between clinical parameters and transcriptional activity. High 
transcriptional activitieswere found in "rapid FHB" and their symptomless contacts. Rapid FHB is 

2 5 characterized by undetectable viral antigens with rapid seroconversion to anti-HBe and decline into 

coma within two weeks of first symptoms and similarly rapid spontaneous recovery. Both 
symptomless contacts studied here were implicated in rapidly progressive FHB in two successive 
wives each (4). Liver cell necrosis and ensuing rapid clinical deterioration is explained by triggering 
a massive immune response which, ultimately, favours clearance of virus. Similarly, early, complete 

3 0 cessation of virus replication should favour liver regeneration and explain the rapid clinical recovery 

and, ultimately, good prognosis associated with this subgroup of patients with FHB. Clearly, high 
levels of viraemia are not invariable with high luciferase activities because serum HBV DNA levels 
typically were low in such contacts and undetectable in these FHB cases except within the first week 
and only by nested PCR. Also, massive liver injury is not mevitable with high transcriptional acti\ity 
35 as both contacts had symptomless miW chronic hepatitis. In contrast, those patients with "slow FHB" 
typically show a more protracted clinical and serological course, over several weeks, and have lower 
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5 transcriptional acti\dties than "rapid FHB". Early complete cessation of virus replication is lacking; 
seroconversion may be delayed as HBV DNA levels remain, detectable for several weeks from first 
symptoms. 

Figure 7: Sequences from nt 1549 to 1974 were cloned into the vector pBL. Here the sequence of 
10 CURS, BCP and Enh-II is only shown. Patient numbers were followed as for previous study m 
Section I. CHBV-1 and -2 are contacts of FHB cases. All I-numbers denote chronic carrier controls. 
BCP=basaI core promoter CURS=core upstream regulatory sequence V=insertion X=deletion. 

Figure 8: Nuclear extracts were prepared from HuH7 cells. 0. 1 and 3 ug of nuclear extracts were 
mixed with 30,000 cpm labelled oligonucleotides(oligos) and run on 6% polyacrylamide 
1 5 nondenaturing gel. Figure 8 A nuclear factors binding pattern of oligos from adw and CHE V-2. 3 .B 
and 3.C nuclear factor binding pattern of 5 more variant oligos. £.D 30,000 cpm labelled 
oligonucleotide from FHBV-14 was mixed with Ix, 5x and 25x molar excess of unlabelled 
oligonucleotide as cold competitor (lanes 3, 4 and 5)^ added to 3 ug nuclear extract and resolved on 
6% polyacrylamide gel, lane l=no miclear extrart, lane 2= no cold competitor. 

20 Figure 9: Nuclear extracts were made from HuH7 cells and HeLa cells. 0.1 and 3 ug of nuclear 
extracts were mked with 30,000 cpm labelled oligonucleotide and resolved on 6% polyacrylamide 
non denaturing gel. Figure 9a shows binding of oligonucleotide derived from con-adw (OL-adw). 
CHBV-2, FHBV-14 and FHBV-4 with nuclear extracts from HuH7. Figure 9b shows same 
oligonucleotides binding with nuclear extract made from HeLa cells. . 

25 Figure 10: Cartoon representation of nuclear factor binding assay showing luciferase values and 
variants in the BCP; and 

Figure 11: Oligonucleotides were made from nomal CjosoCOL ew-1), variant Gioso (OL em-1), normal 
Ci249> Ti25o (OL ew-2) and variant T1249, C1250 (OL em-2). Oligonucleotides were mixed with 0. 1 and 
3 ug of nuclear extracts from HuH7 cells and run on 6% polyacrylamide non-denaturing gel. 
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Table 2. 



Nucleotide Sequence' 



Re^on 

Full sequence 0.024 

1373-1631 0.906 

(CURS)1631-1742 0.678 

(BCP)1742.1838 0.009 



Amino Acid Sequence 

Full sequence 0.050 

1-86 0.255 

(CURS)87-123 0.636 

(BCP)124-154 0.042 
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Nucleotide Sequence* 



Region FHBV GigsgFHBV AigsgFHBV Ajgj^non-FHBV 



Full sequence (1901-2458) 


0.045 


0.022 


0.009 


0.018 


Region 1 (1963-2020) 


0.154 


0.787 


0.076 


0.715 


CD4 epitope (2050-2107) 


0.610 


0.361 


0.944 


0.036 


Anti-HBc/el (2122-2161) 


0.919 


0.281 


0.141 


0.036 


Ajiti-HBc/e2 (2200-2251) 


0.724 


0.281 


0.014 


0.281 


Anti-HBc/e3 (2290-2305) 


0.787 


1.000 


0.022 


0.181 


Antigenic regions 


0.108 


0.052 


0.009 


0.014 


Non-antigenic regions 


0.039 


0.043 


0.013 


0.051 



Amino Acid Sequence 



Region 


FHBV 


GijiJFHBV 


AjjmFHBV 


Aigjjnon-FHBV 


Full sequence 


0.108 


0.151 


0.013 


0.013 


Antigenic regions 


0.162 


0.295 


0.009 


0.014 


Non-andgenic regions 


0.235 


0.052 


0.024 


0.183 
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Enh-I 


NRE 
1633/1634 


Enh-lI/CURS/BCP 


FHB cases in 








this study 








FHBV-1 


G l(»5l)» Ti24i>, C|2J0 


+/- 


Tl762 A|82fi> 


FHBV-2 


T* 

^ 1249' ^^1250 


+/+ 


- 


FHBV-3 


^IOM)> Tl249» ^1250 


- +/- 




FHBV-4 


|0 »T» 


+/+ 


G1754 


FHBV-5 


Gio.'iO! T1341;, C|25o 


+/+ 




FHBV-6 


Gl»50» T|24tj, C|25n 


+/+ 




FHBV-7 


G|050i T|24€j, C 125(1 


+/+ 




FHBV-8 




-/- 


T A 

* 1762 '^1764 


FHBV-9 


- 


-/- 


C|740 C|773 


*FHBV-10 . 


- 


-/- 


- 


*FHBV-II 


n/a 


n/a 




FHBVI2 


G|()5(h Tij^y. C|25„ 


+/+ 


A|fi53i Cl7«3» G|754 


FHB V 13 


G|05n> Ti24y. Ci25n 


+/+ 


C|773 


FHBV14 


GllLSn* T|24t;, C|25o 


-/- 


T|764 G!76f) 


FHBV15 


n/a 


n/n 




FHB VI 6 




-/- 


Xi703 Xi8oy X1812 


CHBV-1 




-A 




CHBV-2 




-/- 


T|762 ^1764 Ti766 Ap^.s 


AHBV-I 


G|050» T|249. C|25o 


+/+ 


G1754 



FHB cases 
from Genbank 
HBVP3CSX 
HBVP2CSX 
HBVPIPC 

HBVP2PC 
HPBMUT 
HPBC5HK02 

HBVP4PCXX 
HBVP5PCXX 
HBVP4CSX 
HPBC4HST2 



n/a 
n/a 



-/+ 
-/+ 
n/a 

n/a 

-/- 

-/- 

-/- 

-/+ 

-/+ 

-/+ 



n/a 
n/a 

I762» A|764> T|76j, Aj^gg 

'''neZ* ^1764 
C|773 

C|773 
T|7(i„ Anftj, Tnfig 
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CLAIMS 

1. Hepatitis B virus polynucleotide for use in evaluation of a hepatitis B disease state which 
comprises at least two of the following: 

CO a mutation in the Enhancer I region; 

(ii) a mutation in the Negative Regulatory Element region; 

(iii) a mutation in the Enhancer Il/Core Upstream Regulatory Sequence/Basal Core Promoter region; 
and 

(iv) a mutation which leads to an X peptide amino acid change to provide a cysteine or methionine 
residue; 

the mutation bemg a variation from the normal nucleotide at that position in a respective one of HBV 
genotypes A to F. 

2. Hepatitis B virus polynucleotide for use in detection of fulminant hepatitis B viral infection which 
comprises at least two of the following: 

(i) a mutation at one or more of the following positions 1050, 1249 and 1250; 

(ii) a mutation at one or more of the following positions 1633 and 1634; 

(vi) a mutation at one or more of the following positions 1 653, 1 754, 1 762, 1 764, 1 766, 1 768, 
1809, 1821, 1826 and 1838/9 insertion; 

(iv) a mutation which leads to an X-peptide change to provide cysteine or methionine at one or 

more of amino acid positions 26, 72, 88, 127 and 130; 
the mutations being variations from the normal nucleotide at that postion in a respective one of HBV 
genotypes A to F. 

3. Hepatitis B virus polynucleotide for use in detection of fulminant hepatitis B viral infection which 
comprises at least two of the following: 

(i) a mutation which provides one or more of the following nucleotides at the following positions. 
G(1050), T(1249) and C(1250); 

(ii) a mutation which provides one or more of the following nucleotides at the following positions. 

A, T or C (1633) for genotype A or F 
T, G or C (1633) for genotype B, C or D 
A, T or C (1634) for genotype A, B, C or F 
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T, G or C (1634) for genotype D; 

(iii) a mutation which provides one or more of the following nucleotides at the following 
positions. 

A(1653), G(1754), T(1762), A(1764), T(1766), A(1768), T(1809), T(1821), A(1826). and 
AC (insertion between 1838 and 1839); 

(iv) a mutation which leads to an X-peptide change to provide one of the following amino acids 
at the following amino add positions. 

C(26), M(88), M(130), M(127), M(141), M(5) and C(72). 

4. Hq)atitis B Virus polynucleotide for use in the detection of fijlminant hepatitis B viral infection 
according to any of motifs 1 to 5 of Table 7 herein. 

5. Hepatitis B virus polynucleotide for use in the detection of fubninant hepatitis B viral mfection 
having at least two nucleotides in the poations according to any of numbers 1 to 3 1 m Table 8 herein, 

6. Hepatitis B virus X-peptide having one of the following amino acids at the following amino acid 
positions: 

C(26), M(88), M(130), M(127), M(141), M(5) and C(72). 

7. Polynucleotide probe having a sequence complementary to that of any of claims 1 to 5. 

8. Polynucleotide probe having a sequence complementary to a polynucleotide sequence coding for 
an HBV X-peptide of claim 6. 

9. A test kit for detection of an HBV disease state which comprises polynucleotide fragment probes 
capable of hybridising under appropriate stringency conditions to any two of (i), (ii), (iii) and (iv) of 
claims 1, 2 or 3. 

10. A test kit according to claim 7 comprising probes capable of hybridising to all four of (i) to 
(iv). 



1 1 . Antibody to any of the X peptides of claim 6. 
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12. An immunoassay for the detection of fulminant HBV which comprises an antibody to any of 
the X peptides of claim 6. 

13 . A test method for determining binding interactions between host or viral proteins and HBV 
polynucleotides which comprises; 

- taking a polynucleotide fragment according to any of claims 1 to 6; 
• applying said host or viral protein thereto; and 

- determining the degree of binding between the protein and the polynucleotide fragment. 
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Name Sequences 



1001 

HBVORFS TGGGTCTTTT GGGTTTTGCT GCCCCTTTTA CACAATGTGG TTATCCTGCT TTAATGCCCT 

XXHEPAV T. 

HBVGENl A...T. 

HBVAYWMCG T.A , C . .G T. 

HBVDNA T..A G T. 

XXHEPAV • T. 

HPBHBVAA A T 

AHBVl C G 

FHBVl C G 

FHBV2 A G C 

FHBV3 A G 

FHBV4 G 

FHBV5 G C... G 

FHBV6 C G 

FHBV7 G 

FHBV12 G ...C T. ..G.,C A G T. 

FHBVl 3 G 

HBVP3CSX 

HBVP2CSX 

CHBV2 G AA . . A. . .T. 

HPBMUT .A T A 

FHBVl 4 G 

1061 

HBVORFS TGTATGCCTG TATTCAATCT AAACAGGCTT TCACTTTCTC GCCAACTTAC AAGGCCTTTC 

XXHEPAV A G..G . .G T 

HBVGENl G G 

HBVAYWMCG A.. , G 

HBVDNA A...G 

XXHEPAV A G. .G . .G T 

HPBHBVAA ... A G. : 

AHBVl G 

FHBVl A ..G 

FHBV2 1 G 

FHBV3 A.. G T 

FHBV4 A G 

FHBV5 A G : 

FHBV6 G 

FHBV7 G A G 

FHBV12 ...A G 

FHBV13 A G 

HBVP3CSX G A T 

HBVP2CSX G A T 

CHBV2 C . . C . . . a;-. . ... A 

HPBMUT A 

FHBV14 A G 

1121 

HBVORFS TGTGTAAACA ATACCTGAAC CTTTACCCCG TTGCTAGGCA ACGGCCAGGT CTGTGCCAAG 

XXHEPAV CC . . 

HBVGENl T CC 

HBVAYWMCG CC 

HBVDNA .AA G.*. .A C . . T 

XXHEPAV :..CC 

FIGURE 5-1 

SUBSTITUTE SHEET (RULE 26) 
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HPBHBVAA CC C 

AHBVl C C. . .CC 

FHBVl CC 

FHBV2 CC 

FHBV3 CC 

FHBV4 G CCA 

FHBV5 CC 

FHBV6 C. . .CC 

FHBV7 CC 

FHBV12 CC 

FHBVl 3 CC 

HBVP3CSX 

HBVP2CSX 

CHBV2 T CC 

HPBMUT C CC 

FHBV14 CC W 

1181 

HBVORFS TGTTTGCT5A CGCAACCCCC ACTGGCTGGG GCTTGGTCAT GGGCCATCAG CGCATGCGTG 

XXHEPAV 

HBVGENl 

HBVAYWMCG 

HBVDNA CATA 

XXHEPAV 

HPBHBVAA 

AHBVl 

FHBVl A... A... 

FHBV2 CG A 

FHBV3 

FHBV4 

FHBV5 , 

FHBV6 

FHBV7 T : T 

FHBV12 ...C.TG 

FHBV13 C 

HBVP3 CSX 

HBVP2CSX 

CHBV2 

HPBMUT G 

FHBV14 

1241 

HBVORFS GAACCTTTCT GGCTCCTCTG CCGATCCATA CTGCGGAACT CCTAGCCGCT TGTTTTGCTC 

XXHEPAV G 

HBVGENl C 

HBVAYWMCG G 

HBVDNA GA 

XXHEPAV G 

HPBHBVAA A 

AHBVl TC 

FHBVl .TC T t 

FHBV2 TC 

FHBV3 GC 

FHBV4 TC 

FHBV5 TC 

FHBV6 TC 

FHBV7 TC 

FHBV12 TC 

FHBVl 3 TC 

HBVP3CSX 



EiqffiE.5-2, 
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HBVP2CSX 

CHBV2 C 

HPBMUT 

FHBV14 TC , C 

1301 

HBVORFS GCAGC AGGTC TGGAGCAAAC ATTCTCGGGA CGGATAACTC TGTTGTTCTC TCCCGCAAAT 

XXHEPAV 

HBVGENl G 

HBVAYWMCG T C. .A 

HBVDNA A T C. .A . .G. .G. . . . 

XXHEPAV 

HPBHBVAA C 

AHBVl A T C 

FHBVl A T C 

FHBV2 A T C... .......... 

FHBV3 A T C 

FHBV4 A T C 

FHBV5 A T C N 

FHBV6 A T C 

FHBV7 A T C... .......... 

FHBV12 A T /...C... .......... 

FHBV13 A T C.[. .......... 

HBVP3CSX ][[ .......... .......... 

HBVP2CSX T. .......... ........ 

CHBV2 C C . . . C . . . . . . . . . . . . . . . . . . . 

HPBMUT A 

FHBV14 [ .t! . . . . .......... 

1361 

HBVORFS ♦ ATACATCGTT TCCATGGCTG CTAGGCTGTG CTGCCAACTG GATCCTGCGC GGGACGTCCT 

XXHEPAV A 

HBVGENl 

HBVAYWMCG , . [ [ . . . /a . . . . . . . . . . . . 

HBVDNA T. . .k .... . ..... . . . . . . T. . . . . . . . . . . . . 

XXHEPAV A .......... 

HPBHBVAA G 

AHBVl A .......... .......... 

FHBVl ... A. A .......... .......... .......... 

FHBV2 T. .A ^T.C 

FHBV3 AC .......... 

FHBV4 A .......... 

FHBV5 N..TC A T .......... 

FHBV6 A . . .G 

FHBV7 A [ .T. /...[.. 

FHBV12 A .1 -111111111 1111111111 1^11111111 

FHBV13 A. A , 

HBVP3CSX 

HBVP2CSX [ , ' .......... 

CHBV2 A .......... .......... 

HPBMUT . . . . . . . . . k . . . . . . . . . . 

FHBV14 .......... .......... 

1421 

HBVORFS TTGTTTACGT CCCGTCGGCG CTGAATCCCG CGGACGACCC TTCTCGGGGC CGCTTGGGGA 

XXHEPAV T AC 

HBVGENl .......... ........ kC 

HBVAYWMCG .C A ...... . .AC 

HBVDNA ] . C.. ....... ..... ...hC 

XXHEPAV . . . . . [ . . .T . . . . . . . . AC 

HPBHBVAA .......... ........AC 

FIGURE 5-3 . 
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AHBVl 

FHBVl 

FHBV2 

FHBV3 

FHBV4 

FHBV5 

FHBV6 

FHBV7 

FHBV12 

FHBV13 

HBVP3CSX 

HBVP2CSX 

CHBV2 

HPBMUT 

FHBVl 4 

HBVORFS 

XXHEPAV 

HBVGENl 

HBVAYWMCG 

HBVDNA 

XXHEPAV 

HPBHBVAA 

AHBVl 

FHBVl 

FHBV2 

FHBV3 

FHBV4 

FHBV5 

FHBV6 

FHBV7 

FHBVl 2 

FHBVl 3 . 

HBVP3CSX 

HBVP2CSX 

CHBV2 

HPBMUT 

FHBVl 4 

HBVORFS 
XXHEPAV 
HBVGENl 
HBVAYWMCG 
. HBVDNA 
XXHEPAV 
HPBHBVAA 
AHBVl 
FHBVl 
FHBV2 
FHBV3 
FHBV4 
FHBV5 
FHBV6 
FHBV7 
FHBV12 
FHBVl 3 
HBVP3CSX 
HBVP2CSX 



_8/26 

T T AC 

T T AC 

T T AC 

T T AC 

T C T AC 

T T . T AC 

\ T T AC 

T. T AC 



T T ..AC 



AC 

C. . T T AC 

AC 

1481 . . 

TCTTTCGTCC CCTTCTCCGT CTGCCGTTCC GTCCGACCAC GGGGCGCACC TCTCTTTACG 
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. . .c 








T- 




















.541 






CGGACTCCCC 


GTCTGTGCCT TCTCATCTGC 


CGGTCCGTGT GCACTTCGCT TCACCTCTGC 

. . - A 














A . 












. . .A G. ... 



















FIGURE 5-4 
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CHBV2 v.. A 

HPBMUT A 

FHBV14 A 

1601 

HBVORFS ACGTCGCATG GAGACCACCG TGAACGCC CACCACTTC TTGCCCAAGG TCTTACATAA 

XXHEPAV A A,CA 

HBVGENl A 

HBVAYWMCG A 

HBVDNA T. -AAGT C 

XXHEPAV A A.CA 

HPBHBVAA A 

AHBVl GGA.G 

FHBVl GAA.G 

FHBV2 T GGA.G 

FHBV3 GAA.G 

FHBV4 GGA-.G 

FHBV5 GGA.G ; 

FHBV6 GGA.. 

FHBV7 . GGA.G 

FHBV12 GGAA. C AC... 

FHBV13 GGAA. C.. 

HBVP3CSX 

HBVP2CSX 

CHBV2 A 

HPBMUT GC C.T.TA T.... 

FHBV14 : A 

1658 

HBVORFS GAGGACTCTT GGACTCTCAG CAATGTCAAC GACCGACCTT GAGGCATACT TCAAAGACTG 

XXHEPAV T. T 

HBVGENl T. T 

HBVAYWMCG 

HBVDNA C C 

XXHEPAV T. T 

HPBHBVAA T. T 

AHBVl T 

FHBVl T 

FHBV2 T 

FHBV3 T 

FHBV4 T 

FHBV5 T 

FHBV6 T 

FHBV7 T. .C 

FHBVl 2 GT 

FHBV13 TA 

HBVP3CSX T. T 

HBVP2CSX T 

CHBV2 ...T 

HPBMUT T. T 

FHBV14 T. T 

1718 

HBVORFS TTTGTTTA2.G GACTGGGAGG AGTTGGGGGA GGAGATTAGA TTAAAGGTCT TTGTACTAGG 

XXHEPAV A T 

HBVGENl A G 

HBVAYWMCG A . .G T.G T 

HBVDNA .G C G T 

XXHEPAV A \ T 

HPBHBVAA C T.A 

AHBVl A G..G 



FIGURE 5-5 
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FHBVl 

FHBV2 

FHBV3 

FHBV4 

FHBV5 

FHBV6 

FHBV7 

FHBVl 2 

FHBVl 3 

HBVP3CSX 

HBVP2CSX 

CHBV2 

HPBMUT 

FHBVl 4 

HBVORFS 

XXHEPAV 

HBVGENl 

HBVAYWMCG 

HBVDNA 

XXHEPAV 

HPBHBVAA 

AHBVl 

FHBVl 

FHBV2 

FHBV3 

PHBV4 

FHBV5 

FHBV6 

FHBV7 

FHBVl 2 

FHBVl 3 

HBVP3CSX 

HBVP2CSX 

CHBV2 

HPBMUT 

FHBVl 4 

HBVORFS 

XXHEPAV 

HBVGENl 

HBVAYWMCG 

HBVDNA 

XXHEPAV 

HPBHBVAA 

AHBVl 

FHBVl 

FHBV2 

FHBV3 

FHBV4 

FHBV5 

FHBV6 

FHBV7 

FHBVl 2 

FHBVl 3 

HBVP3CSX 

HBVP2CSX 

CHBV2 



.A 

.A 
.A 
-A 
.A 
.A 
.A 
.A 
.A 
.A 
.A 
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.G. 
.G. 
.G. 



.G 
.G 
.G 
.G 
• G 
.G 
.G 



.A 
.A 



. ; .c. 

.T.A. 



1778 
AGGCTGTAGG 



.T.A.T. 
.T.A.T. 
. . .T.G. 



A. 
A, 



.T. 
.T. 
.T. 



CATAAATTGG TCTGCGCACC AGCACCATGC AACTTTTTCA CCTCTGCCTA 



1838 

A-TCATCQ'CT TGTTCATGTC CTACTGTTCA AGCCTCCAAG CTGTGCCTTG GGTGGCTTTG 



.A 
.A 



FIGURE 5-6 
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HPBMUT 
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PHBV14 
















1897 














HBVORFS 


GGGCATGGAC 


ATTGACCCTT ATAAAGAATT 


TGGAGCTACT 


GTGGAGTTAC 


TCTCATTTTT 


XXHEPAV 






. .T 










HBVGENl 
















HBVAYWMCG 






. .T 




A 


. . .G. 




HBVDNA 
















XXHEPAV 
















HPBHBVAA 
















AHBVl 
















FHBVl 
















FHBV2 
















FHBV3 
















FHBV4 
















FHBV5 
















FHBVd 
















FHBV7 
















FHBVl 2 
















FHBVl 3 
















HBVP3CSX 














T 


HBVP2CSX 
















CHBV2 
















HPBMUT 
















FHBVl 4 
















1957 














T V^^V T^^T^ 

HBVORFS 


GCCTTCTGAC 


TTTTTTCCTT CGGTACGAGA 


TCTTCTAGAT 


ACCGCCTCAG 


CTCTGTATCG 


XXHEPAV 
















HBVGENl 
















HBVAYWMCG 
















HBVDNA 
















XXHEPAV 
















HPBHBVAA 
















AHBVl 
















FHBVl 
















F HBV2 
















rHByj 




C.C. 












rnBV4 




. .0. 












FHBV5 








.* ' 








rHBVo 
















FnBVV 
















FHBVl 2 
















r HBV13 
















HBVP3CSX 


CA 














HBVP2CSX 
















CHBV2 














A 


HPBMUT 
















FHBVl 4 


















2017 














HBVORFS 


GGATGCCTIA GAGTCTCCTG AGCATTGTTC 


ACCTCACCAT ACTGCACTCA GGCAAGCAAT 


XXHEPAV 
















HBVGENl 
































HBVDNA 


A. .A 














XXHEPAV 
















HPBHBVAA 
















AHBVl 
















FHBVl 

















FIGURE 5-7 
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FHBV2 ...A ^2/26 

FHBV3. ...A G 

FHBV4 A.. A : 

FHBV5 ...A C. 

FHBV6 ...A.. 

FHBV7 ...A.. 

FHBV12 ...A CA 

FHBV13 ...A AC 

HBVP3CSX 

HBVP2CSX A 

CHBV2 A 

HPBMUT ...A 

FHBV14 ...A 

2077 

HBVORFS TCTTTGCTUG GGGGAACTAA TGACTCTAGC TACCTGGGTG GGTGTTAATT TGGAAGATCC 

XXHEPAV A. .G G . 

HBVGENl 

HBVAYWMCG T C G C. 

HBVDNA . . .C G AT A 

XXHEPAV A..G G 

HPBHBVAA ...C T A T. C G 

AHBVl 

FHBVl A 

FHBV2 

FHBV3 

FHBV4 .,.A A 

FHBV5 C C... 

FHBV6 . 

FHBV7 

FHBVl 2 

FHBVl 3 

HBVP3CSX A ...CT A A C 

HBVP2CSX C C 

CHBV2 G C G 

HPBMUT . . .G TG A GC 

FHBV14 . . .A T C G 

2137 

HBVORFS agcatctag:; gacctagtag tcagttatgt caacactaat atgggcctaa agttcaggca 

XXHEPAV .AT. . .C A 

HBVGENl T. .T T 

HBVAYWMCG .A C T 

HBVDNA . . ,G T A 

XXHEPAV -AT...C A 

HPBHBVAA .A T 

AHBVl A 

FHBVl A 

FHBV2 A . -A. AT AT 

FHBV3 A ; 

FHBV4 A 

FHBV5 .\ 

FHBV6 A : 

FHBV7 A 

FHBV12 A 

FHBVl 3 A 

HBVP3CSX A A T 

HBVP2CSX T 

CHBV2 ..T...C T C C T 

HPBMUT .A C A.T. ...A T. G 

FIGURE 5-8 
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FHBV14 .A C T C. T T 

2197 

HBVORFS ACTATTGT'^G TTTCACATTT CTTGTCTCAC TTTTGGAAGA GAAACAGTCA TAGAGTATTT 

XXHEPAV T. 

HBVGENl ...T G 

HBVAYWMCG G G 

HBVDNA T..A C. .T G..T..AC ,T. .A 

XXHEPAV , T 

HPBHBVAA G 

AHBVl ...C ....C..T. 

FHBVl ...C ; C..T 

FHBV2 ...T C..T 

FHBV3 ...T , C..T 

FHBV4 . . .C .C. .TC 

FHBV5 . . .C C. .T 

FHBV6 ...M , T..A C..Y G. .T. . AC .C . .A 

FHBV7 ...C C..T 

PHBV12 ...C . T 

FHBV13 ...C C..T 

HBVP3CSX G 

HBVP2CSX . . .C G, .T. 

CHBV2 T. .T. .AC AT. .T 

HPBMUT TC 

FHBV14 y T.T G G. . .C .T 

2257 

HBVORFS GGTGTCTrrC GGAGTGTGGA TTCGCACTCC TCCAGCTTAT AGACCACCAA ATGCCCCTAT 

XXHEPAV T 

HBVGENl 

HBVAYWMCG T A 

HBVDNA ...C 

XXHEPAV T 

HPBHBVAA T 

AHBVl 

FHBVl 

FHBV2 

FHBV3 

FHBV4 A 

FHBV5 

FHBV6 .,.c 

FHBV7 

FHBV12 A 

FHBVl 3 

HBVP3CSX A 

HBVP2CSX 

CHBV2 T . .T 

HPBMUT A 

FHBV14 

2317 

HBVORFS CTTATCAACA CTTCCGGAAA CTACTGTTGT TAGACGA CGAGGCA GGTCCCCTAG 

XXHEPAV G. 

HBVGENl G 

HBVAYWMCG G. , .TG 

HBVDNA *. . ..... .G.A 

XXHEPAV G 

HPBHBVAA G 

AHBVl .c! 

FHBVl .C. ] [, 

FHBV2 .C 

FIGURE 5-9 
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FHBV3 .C 

FHBV4 -C 

FHBV5 .C 

FHBV6 -C 

FHBV7 .C 

FHBV12 .C 

FHBV13 .C G 

HBVP3CSX A •. A 

HBVP2CSX G A 

CHBV2 G. ..GG.A..A A.CGG GAC 

HPBMUT G 

FHBV14 .... G A 

2371 

HBVORFS AAGAAGAACT CCCTCGCCTC GCAGACGAAG ATCTCAATCG CCGCGTCGCA GAAGATCTCA 

XXHEPAV G 

HBVGENl 

HBVAYWMCG G 

HBVDNA G 

XXHEPAV G 

HPBHBVAA G A A 

AHBVl G 

FHBVl G 

FHBV2 G A. 

FHBV3 A G. A GAG 

FHBV4 G. 

FHBV5 G . 

FHBV6 G 

FHBV7 G 

FHBVl 2 G 

FHBV13 G . 

HBVP3CSX 

HBVP2CSX 

CHBV2 C 

HPBMUT C GC 

FHBVl 4 G 

2431 

HBVORFS ATCTCGGGAA TCTCAATGTT AGTATTCCTT GGACTCATAA AGTGGGTAAC TTTACGGGGC 

XXHEPAV G A 

HBVGENl G A A 

HBVAYWMCG G A 

HBVDNA G G T 

XXHEPAV G A 

HPBHBVAA C G A... 

AHBVl G G.. • . T..T. 

FHBVl ^ 

FHBV2 

FHBV3 C G G T. .T. 

FHBV4 

FHBV5 

FHBV6 G G T. .T. 

FHBV7 

FHBV12 

FHBV13 

HBVP3CSX 

HBVP2CSX T G. . . . .A. . . 

CHBV2 C G A 

HPBMUT G G A 

FHBV14 ;.. C • G..,. .A 
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1643 1686 

I (CURS Siaris) |(Enh-n starts) 

Con-ayv CAAGGTCTTACATAAGAGGACTCTTGGACTCTCTGTAATGTCAACGACCGACCTTGAGGCATACTT 

Con- adw 

FHBV-1 

FHBV-4 

FHBV-5 

FHBV-8.1 

FHBV-8.5 

CHBV-1 

FHBV-9 

FHBV-1 2 

FHBV-1 4 

CHBV-2 

FHBV-1 5 

FHBV-1 6 

1-40 
1-59 
1-69 
1-89 
1-95 
1-105 
1-177 
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1743 

^ (CURS ends, BCP starts) 

Con-ayw CAAAGACTGTTTGTTTAAAGACTGGGAGGAGCTGGGGGAGGAGATTAGATTAAAGGTCTTTGTATT 

Con-adw G G G 

FHBV-1 X G-^ T C- 

FHBV-4 X C T-A-T-A 

FHBV- 5 G T C T-A-T-A 

FHBV-8.1 T T-G 

FHBV- 8 . 5 -G G A C G T T 

CHBV-1 G G 

FHBV-9 G G G 

FHBV- 12 G T G 

FHBV-14 G T C T-G 

CHBV- 2 G T C T-A-T-A~C— 

FHBV-15 G T C ^ C- 

FHBV-16 G : T 

1-40 AT T XXXXXXXXX-A— A 

1-59 C T C- 

1-69 X r- C- 

1-89 X T-A C- 

1-95 X T-A C- 

1-105 X C T-A C- 

1-177 C T A C- 
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4 (Enh-II ends) ^ 

Con-ayw AGGAGGCTGTAGGCATAAATTGGTCTGCGCACCAGCACCATGCAACTTTTTCACCTCTGCCTAA—TC 

Con-adw C 
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FHBV-4 , T 

FHBV-5 T ' 
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FHBV-8.5 • 
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FHBV-12 C T C 

FHBV-14 C A- 
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Cases Luciferase Variations in the BCP. 
value xlO^ 

Con-acfM^ 1 .50 



FHBV-1 



FHBV-4 



11.51 



Ti762 ^1826 



Cl753 A1757T1762 Ai764, 
Ti766 Ai768Ti810 



FHBV-5 

FHBV-14 
CHBV-2 
FHBV-1 5 



10,25 C1753 A1757 Tl762 A1764, 

Tl766 Ai768Ti810 



11.39 

9.68 

7.98 



A1757 Ti764 Gl766 Al834 
Tl845 

C1753 A1757T1762 Ai764 
Ai768 C1771 TI8IO 

Tl678 Ci7S2 A 1757 
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